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Input file flhl4926cons; Output File flhl4926tra 
Sequence length 2818 

CCACGCCTCCGATTTATAGAAAGAAGCTATAGTC 

CTGGAAAGCCTTACAAGACTGAGGAATATCAGACTGCGAATCACCGGGAACGGTTC 

TCTCCCC ATCTTCGC AT ATTCT AATGGC AAAACAAGTGG AAG AAAAG AGG AAGC ATG ACTGC AG ATC AG ATC AGTTCTC 

TTTGTGGATTATATTTTCAGTAAAATGTATGGATCTAT^ 

MANYSHAADNILQ 13 

GGCTGTATCCTTATCCTCCATCCATCT ATG GCG AAC TAT AGC CAT GCA GCT GAC AAC ATT TTG CAA 39 

NLSPLTAFLKLTSLGFIIGV 33 

AAT CTC -TCG CCT CTA ACA GCC TTT CTG AAA CTG ACT TCC TTG GGT TTC ATA ATA GGA GTC 99 

SVVGNLLIS ILLVKDKTLHR 53 

AGC GTG GTG GGC AAC CTC CTG ATC TCC ATT TTG CTA GTG AAA GAT AAG ACC TTG CAT AGA 159 

APYYFLLDLCC SDI LRSAIC 73 

GCA CCT TAC TAC TTC CTG TTG GAT CTT TGC TGT TCA GAT ATC CTC AGA TCT GCA ATT TGT 219 

lQ FPFVFNSVKNGSTWTYGTLT 93 

hi TTC CCA. TTT GTG TTC AAC TCT GTC AAA AAT GGC TCT ACC TGG ACT TAT GGG ACT CTG ACT 279 

~ff- CKVIAFLGVLSCFHTAFMLF 113 

^ TGC AAA GTG ATT GCC TTT CTG GGG GTT TTG TCC TGT TTC CAC ACT GCT TTC ATG CTC TTC 339 

=C CISVTRYLAIAHHRFYTKRL 133 

m TGC ATC AGT GTC ACC AGA TAC TTA GCT ATC GCC CAT CAC CGC TTC TAT ACA AAG AGG CTG 399 

q TFWTCLAVI CMVWTLSVAMA 153 

ACC TTT TGG ACG TGT CTG GCT GTG ATC TGT ATG GTG TGG ACT CTG TCT GTG GCC ATG GCA 459 

= FPPVLDVGTYS FIREEDQCT 17 3 

y> TTT CCC CCG GTT TTA GAC GTG GGC ACT TAC TCA TTC ATT AGG GAG GAA GAT CAA TGC ACC 519 

gg FQHRSFRANDS LGFMLLLAL 193 

TTC CAA CAC CGC TCC TTC AGG GCT AAT GAT TCC TTA GGA TTT ATG CTG CTT CTT GCT CTC 57 9 

ILL A- TQLVVLKLIFFVHDRR 213 

ATC CTC CTA GCC ACA CAG CTT GTC TAC CTC AAG CTG ATA TTT TTC GTC CAC GAT CGA AGA 639 

KMKPVQFVAAVSQNWTFHGP 233 

AAA ATG AAG CCA GTC CAG TTT GTA GCA GCA GTC AGC CAG AAC TGG ACT TTT CAT GGT CCT 699 

GASGQAAANWLAGFGRGPTP 253 

GGA GCC AGT GGC CAG GCA GCT GCC AAT TGG CTA GCA GGA TTT GGA AGG GGT CCC ACA CCA 7 59 

PTLLGIRQNANTTGRRRLLV 273 

CCC ACC TTG CTG GGC ATC AGG CAA AAT GCA AAC ACC ACA GGC AGA AGA AGG CTA TTG GTC 819 

LDEFKMEKR I S RMFY I M T F L 293 

TTA GAC GAG TTC AAA ATG GAG AAA AGA ATC AGC AGA ATG TTC TAT ATA ATG ACT TTT CTG 879 

FLTLWGPYLVACYWRVFARG 313 

TTT CTA ACC TTG TGG GGC CCC TAC CTG GTG GCC TGT TAT TGG AGA GTT TTT GCA AGA GGG 93 9 
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# 



P V V P G G F 
CCT GTA GTA CCA GGG GGA TTT 

I N P F V C I 
ATC AAT CCT TTT GTC TGC ATT 

L L Y C R K S 
CTT CTT TAG TGC AGA AAA TCC 



L T A A V W M 
CTA ACA GCT GCT GTC TGG ATG 

F S N R E L R 
TTC TCA AAC AGG GAG CTG AGG 

R V P R E P Y 
AGG TTA CCA. AGG GAA CCT TAC 



S F A Q A G 333 

AGT TTT GCC CAA GCA GGA 999 

R C F S T T 353 

CGC TGT TTC AGC ACA ACC 1059 

C V I 371 

TGT GTT ATA TGA 1113 



GGGAGCATCTGTAAATCTTTAGCCTTGTGAAAACTAACCTTCTCTGCTG 
AAGAAATTCAAGAATGGAATCAGCAGTTTTAAGGATTTGGG^ 
CTATTTTAAATCTCAGAGTGATCCTGCTGACTGCCAGCAAA^ 
TTTCTTTATGTGGTCAAAAACTAGATAATGAAAGTAGCAGGTGCTA 

ATATGAAAAAAC ATC AAAAAAC AATT AGC ATTGG AC ATCTTAAT AAATT AAGTTG AC ATGAGGTO 
PI AACTAATTTTAGAAGTTTGAAGACTTTAAAACATTT 

I s : AAAGTACTGTAATCC ACTAAAG ACGTGC CAATG AATT ATTGG AAT ATCAC ACTTT AAAAAC CGCCTTGT AAGTTCTGGG 

GAGCATTCCAAAGCAGTATATTGGTTCCAATTAGAGTTTACTT^^ 
%- TTTCCTCATCTACTAGTAAGATTGCTAGCATTGAACTGTATTATGTGGTTl^ 

srr 

fjj TTCATTTATATTTTACAAATGCTAGATATTGGTCTGGGAGGCAA^ 

CT AATAATGC AG AAT AAA.C AC ATGTTGCCTT AAAGGGTTATCTAG KATC C YTTC ATC TTATTAGC ACTGG AGCAAA/T AG 

YC AAGGGAAATC RAJVTC AGT AACTGGTC ATGGTC ATGC ATCT RAAAGTGC ATGGAAG ATC ATTT AGT ACTTTTTCCTTT 

01 TTTCTCACATGGTTTGAAACTTAAAGTGCACATCM^ 
•. n 

£j ASTGTTCTAAGAAGCAGGCAGTTGATGTATGTTTATATTTTAAGTCAGCTGTCGA 

C ATCCTGC AC AJ^TTTGTG AAGC ATTT ATTCT ACTG AAGGC AC AGTCTTGTTT AT ACTTTCTGC AC ATTC AGTGT A.TTGG 

TCATTTAAATTATTTaAGTTTTAACTTGTGAAAGCTT 

GAGTCTCATTCTTTAAGATACANATGTGTGAACTTCAATATAAAGTTGC 

AAAAAAAAAAA 
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Protein Family/Domain HMM Matches for fflil4926orfaa 

> PF00 001 1 7 tm 1 7 transmembrane receptor (rhodopsin family) 

Score: 146.01 Seq: 37 339 Model: 1 269 

*GNiLVIWvIcRyRRMRTPMNYFIvNLAvADLLFslf tMPFWMvYy vMqg 
GN+L+ + + + + + + + + + ++YF++ L + D+L S + + PF + + + + 
flhl492 6or 37 GNLLISILLVKDKTLHRAPYYFLLDLCCSDILRSAICFPFVFNSVKNGS 85 

! ! 
feaf 

RWp FGd f MC r I WmY F D YMNMY AS I F f LT c I S I DRY L W A I C H PMr YmRWMT 
yj +W++G++ C+++ ++ + + + + + F+L CIS+ RYL AI H Y + T 

flhl4926or 86 TWTYGTLTCKVIAFLGVLSCFHTAFMLFCISV TRY L-AIAHHRFYTKRLT 134 

yj pRHRAWvMI iilWvMSFHSMPPFLMFr . Ws tyrDEneWNmTWCmlyDWP 

SJ + ++++I+++W++S+++++PP+L + +S+ R E++ C + -K + 

J* flhl4926or 135 FW-TCLAVICMVWTLSVAMAFPPVLDVGTYSFIREEDQ CTF-Q-- 175 

yi ewMWrWYvILmt iimgFYIPMilMlFCYwRIYRIaRlWMRMIpswQr . . . 

s + R+ + +GF++ + ++L ++Y + ++ + ++ 

D flh!4926or 176 HRSFR-ANDS-LGFMLLLALILLATQLVYLKLIFFVHDR RKMKP 217 



m 



flhl4926or 218 VQ F VAAVS QNWT F HG PGA S GQ AAANWL AG FG RG PT P PT L LG I RQN ANTTG 267 

RRrmS. . . . mRrERRivKMl i I IMvVFIICWl PYFIvmf MDTLMMwwFCe 
RRR+ + + + E-t-RI + +M 1+ + + F+ W+PY ++ + + + 4-F 

flhl4926or 268 RRRLLVLDEFKMEKRISRMFYIMTFLFLTLWGPYLVACY WRVFAR 312 

fC . IwrrlWmYIf eWLaYvNCpCiNPI IY* 
++ + +++W++++ • INP++ 
flhl4926or 313 GPWPGGFLT-AAVWMSFAQA-GINPFVC 339 

> MILPAT00G28 I ngf NGF / BNDF / Neurot rophins 3,4, and 6 family of cytokines 

Score: 0.47 Seq: 290 302 Model: 1 13 

REF xxxxxxxxxxxxx 
* MSMLFYTMFIsYF* 
M+ LF+T+ +Y+ 
flhl4926or 290 MTFLFLTLWG PYL 302 
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SS Alpha, Regions - Garnier-Robson 
IS Alpha. Regions - Chou-Fasman 
EI Beta, Regions - Garnier-Robson 
U Beta. Regions - Chou-Fasman 
El Turn, Regions - Garnier-Robson 
El Turn, Regions - Chou-Fasman 
13 Coil, Regions - Garnier-Robson 

m Hydrophilicity Plot - Kyte-Doolittle 



E Alpha, Amphipathic Regions - Eisenberg 
E Beta, Amphipathic Regions - Eisenberg 
01 Flexible Regions - Karplus-Schulz 



C3 Antigenic Index - Jameson- Wolf 



□ Surface Probability Plot - Emini 
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Analysis of flhl4926orfaa (371 aa) 




1 41 81 121 1G1 201 241 281 321 3G1 



>f lhl4926orfaa 

MANYSHAADNILQNLSPLTAFLKLTSLGFIIGVSVVGNLLISILLVKDKTLHRAPYYFLL 
DLCCSDILRSAICFPFVFNSVKNGSTWTYGTLTCKVIAFLGVLSCFHTAFMLFCISVTRY 
LAIAHHRFYTKRLTFWTCLAVICMVWTLSVAMAFPPVLDVGTYSFIREEDQCTFQHRSFR 
ANDSLGFMLLLALILLATQLVYLKLIFFVHDRRKMKPVQFVAAVSQNWTFHGPGASGQAA 
ANWLAGFGRG PTPPTLLGIRQNANTTGRRRLLVLDEFKMEKRISRMFYIMTFLFLTLWGP 
YLVACYWRVFARGPWPGGFLTAAVWMSFAQAGINPFVCIFSNRELRRCFSTTLLYCRKS 
RLPREPYCVI* 



FIGURE A 



# prositescan will scan one or more sequences against a set of sequence patterns 

Database: Release 12.2 of February 1995 
Tue Apr 7 18:49:19 1998 
1025 patterns 

Query= f lhl4926or f aa 

>PS00001|PDOC00001|ASN_GLYCOSYLATION N-glycosylation site 
N[~*P) [ST] [~*P] 



Query : 


3 NYSH 


6 


Query : 


83 NGST 


86 


Query : 


182NDSL 


185 


Query : 


227NWTF 


230 


Query : 


2 64NTTG 


267 



>PS00004|PDOC00004|CAMP_PHOSPHO_SITE cAMP- and cGMP-dependent protein kin^ 
|=i phosphorylation site. 

ZZ [RK] {2} [A-Z] [ST] 

Lu 

*j Query: 131KRLT 13 4 

4= Query: 281 KRIS 284 

Lfl 

>PS00005|PDOC00005|PKC_PHOSPHO_SITE Protein kinase C phosphorylation site 
M [ST] [A-Z] [RK] 



Query : 


80 SVK82 


Query : 


9 3 TCK95 


Query : 


130TKR132 


Query : 


178SFR180 


Query : 


266TGR268 


Query : 


342 SNR344 



>PS00006|PDOC00006|CK2_PHOSPHO_SITE Casein kinase II phosphorylation site 
[ST] [A-Z] {2} [DE] 

Query : 342 SNRE 34 5 

>PS00008 | PDOC00008 | MYRISTYL N-myr is toylation site. 
G [ A * EDRKH PFYW ] [A-Z] {2} [ STAGCN ] [~*P] 

Query; 84 GSTWTY 8 9 
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Query: 90 GTLTCK 9 5 
Query: 101 GVLSCF 106 
Query: 237GQAAAN242 
Query: 258GIRQNA263 
Query: 318GGFLTA3 23 

>PS00009 | PDOC00009 | AMIDATION Amidation site. 
[A-Z]G[RK] { RK] 

Query: 266TGRR 269 
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